Web robot detection in the scholarly information environment
نویسندگان
چکیده
An increasing number of robots harvest information on the world wide web for a wide variety of purposes. Protocols developed at the inception of the web laid out voluntary procedures in order to identify robot behaviour, and exclude it if necessary. Few robots now follow this protocol and it is now increasingly difficult to filter for this activity in reports of on-site activity. This paper seeks to demonstrate the issues involved in identifying robots and assessing their impact on usage in regard to a project which sought to establish the relative usage patterns of open access and non-open access articles in the Oxford University Press published journal Glycobiology, which offers in a single issue articles in both forms. A number of methods for identifying robots are compared and together these methods found that 40% of the raw logs of this journal could be attributed to robots.
منابع مشابه
A Navigation System for Autonomous Robot Operating in Unknown and Dynamic Environment: Escaping Algorithm
In this study, the problem of navigation in dynamic and unknown environment is investigated and a navigation method based on force field approach is suggested. It is assumed that the robot performs navigation in...
متن کاملMotion detection by a moving observer using Kalman filter and neural network in soccer robot
In many autonomous mobile applications, robots must be capable of analyzing motion of moving objects in their environment. Duringmovement of robot the quality of images is affected by quakes of camera which cause high errors in image processing outputs. In thispaper, we propose a novel method to effectively overcome this problem using Neural Networks and Kalman Filtering theory. Thistechnique u...
متن کاملDevelopment of RadRob15, A Robot for Detecting Radioactive Contamination in Nuclear Medicine Departments
Accidental or intentional release of radioactive materials into the living or working environment may cause radioactive contamination. In nuclear medicine departments, radioactive contamination is usually due to radionuclides which emit high energy gamma photons and particles. These radionuclides have a broad range of energies and penetration capabilities. Rapid detection of radioactive contami...
متن کاملMobile robot wall-following control using a behavior-based fuzzy controller in unknown environments
This paper addresses a behavior-based fuzzy controller (BFC) for mobile robot wall-following control.The wall-following task is usually used to explore an unknown environment.The proposed BFC consists of three sub-fuzzy controllers, including Straight-based Fuzzy Controller (SFC),Left-based Fuzzy Controller (LFC), and Right-based Fuzzy Controller (RFC).The proposed wall-...
متن کاملA density based clustering approach to distinguish between web robot and human requests to a web server
Today world's dependence on the Internet and the emerging of Web 2.0 applications is significantly increasing the requirement of web robots crawling the sites to support services and technologies. Regardless of the advantages of robots, they may occupy the bandwidth and reduce the performance of web servers. Despite a variety of researches, there is no accurate method for classifying huge data ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- J. Information Science
دوره 34 شماره
صفحات -
تاریخ انتشار 2008